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Abstract. Vertical transport by moist sub-grid scale pro- 
cesses such as deep convection is a well-known source of 
uncertainty in CO 2 source/sink inversion. However, a dy- 
namical link between vertical transport, satellite based re- 
trievals of column mole fractions of CO 2 , and source/sink 
inversion has not yet been established. By using the same of- 
fline transport model with meteorological fields from slightly 
different data assimilation systems, we examine sensitivity 
of frontal CO 2 transport and retrieved fluxes to different pa- 
rameterizations of sub-grid vertical transport. We find that 
frontal transport feeds off background vertical CO 2 gradi- 
ents, which are modulated by sub-grid vertical transport. The 
implication for source/sink estimation is two-fold. First, CO 2 
variations contained in moist poleward moving air masses 
are systematically different from variations in dry equator- 
ward moving air. Moist poleward transport is hidden from 
orbital sensors on satellites, causing a sampling bias, which 
leads directly to small but systematic flux retrieval errors 
in northern mid-latitudes. Second, differences in the rep- 
resentation of moist sub-grid vertical transport in GEOS -4 
and GEOS-5 meteorological fields cause differences in ver- 
tical gradients of CO 2 , which leads to systematic differences 
in moist poleward and dry equatorward CO 2 transport and 
therefore the fraction of CO 2 variations hidden in moist air 
from satellites. As a result, sampling biases are amplified and 
regional scale flux errors enhanced, most notably in Europe 
(0.43 ± 0.35 PgC yr -1 ). These results, cast from the perspec- 
tive of moist frontal transport processes, support previous ar- 
guments that the vertical gradient of CO 2 is a major source 
of uncertainty in source/sink inversion. 


1 Introduction 

Measurements of atmospheric CO 2 mixing ratio contain in- 
formation about land and ocean carbon sinks, which act 
as natural buffers against rising fossil fuel emissions. Flux 
inversion methods combine information from winds, CO 2 
measurements, and surface flux estimates to infer the size and 
distribution of these sinks (e.g., Gurney et al., 2002). Contin- 
uous records residing in the continental boundary layer close 
to terrestrial ecosystems allow quantitative flux estimation at 
finer spatial scales then previously possible (e.g.. Law et al., 
2003; Peylin et al., 2005; Zupanski et al., 2007; Lauvaux et 
al., 2009; Schuh et al., 2010), while spaced-based measure- 
ments of total column CO 2 help fill critical gaps in the in-situ 
network (Baker et al., 2010; Chevallier et al., 2010). Nev- 
ertheless, model transport error, specifically related to sub- 
grid scale vertical transport, remains a well-known but poorly 
characterized source of uncertainty in source/sink inversions 
of surface and column CO 2 data (Denning et al., 1999; Yi et 
al., 2004; Yang et al., 2007; Stephens et al., 2007; Houweling 
et al., 2010; Chevallier et al., 2010; Liu et al., 201 1). 

In the era of satellite measurements, an additional com- 
plication arises because the strongest horizontal gradients in 
CO 2 occur along frontal boundaries that are typically hidden 
from orbital sensors by clouds (Corbin and Denning, 2006), 
leading to systematic sampling errors of up to 1 .5 ppm at sea- 
sonal scales (Corbin and Denning, 2008). Because a signif- 
icant portion of the synoptic signal is correlated with moist 
processes and therefore likely to be unobserved by satellites 
(Parazoo et al., 2011), covariance of moist transport with sur- 
face CO 2 flux will cause errors in top-down flux estimates if 
not represented correctly in transport models. 
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In this study we quantify differences in forward calcula- 
tions of moist frontal CO 2 transport and examine the im- 
pact on source/sink inversion. Forward simulations are run 
using a common tracer transport model and identical sur- 
face fluxes, but with Uansport driven by meteorological fields 
from four versions of the National Atmospheric Space Ad- 
ministrations’ Goddard Earth Observing System Data As- 
similation System (GEOS-DAS). The four resulting simu- 
lations are analyzed in terms of eddy and mean meridional 
mass fluxes of CO 2 , which are calculated using eddy decom- 
position (Parazoo et al., 2011). This approach characterizes 
moist frontal CO 2 transport into a single term, which can be 
used as a diagnostic for inter-model comparison. Uncertainty 
in frontal transport is quantified as the spread in forward cal- 
culations of eddy Uansport. 

Unique aspects of forward simulations include the use of 
an identical tracer model, identical surface fluxes, and ana- 
lyzed meteorological fields derived from a common general 
circulation model (GCM). Conversely, intercomparison stud- 
ies that use a wider set of models are often unable to account 
for sensitivity to a wide range of factors, including differ- 
ences in dynamical core, meteorological analysis fields, co- 
ordinate systems, and surface fluxes. While no forward simu- 
lation of CO 2 is flawless, here we are at least able to account 
for sensitivity of moist frontal CO 2 transport to an isolated 
set of factors, specifically horizontal grid spacing and sub- 
grid scale vertical Uansport due to moist convection and ver- 
tical diffusion. 

We also examine the impact of model differences in moist 
frontal transport on the inversion of total column CO 2 re- 
trievals from satellite remote sensing insttuments using Ob- 
servation System Simulation Experiments, or OSSE’s. Sev- 
eral OSSE’s are presented in order of increasing complica- 
tion, first to establish baseline flux reUieval errors due to op- 
timization and data screening, then to examine the degree 
to which signal detection is possible, and finally to evaluate 
sensitivity of signal detection to model transport. Each OSSE 
has two primary steps: (1) the nature run, in which synthetic 
retrievals of column CO 2 are created by sampling forward 
simulations according to the Greenhouse gases Observing 
SATellite (GOSAT) (e.g., Kuze et al., 2009), and (2) the in- 
version run, in which CO 2 fluxes are recovered from syn- 
thetic retrievals using ensemble optimization. 

OSSE’s are similar in design to Chevallier et al. (2010) 
in that one set of experiments use a common set of mete- 
orological fields for transport in Steps 1 and 2 and another 
set examines transport bias by using different meteorological 
fields in Steps 1 and 2. Several key differences are as follows: 
(1) the current study uses the same Uansport model in all ex- 
periments, eliminating sensitivity to differences in numerical 
integration; (2) meteorological fields are derived from very 
similar data assimilation systems but using uniquely differ- 
ent representations of sub-grid vertical transport; (3) use of 
ensemble data assimilation framework and estimation of sys- 
tematic biases to component fluxes (technique described in 


Lokupitiya et al., 2008 and references therein); (4) a subset 
of end-to-end OSSE calculations are performed to quantify 
baseline errors in the inversion system, in particular due to 
temporal sampling errors, before errors related to transport 
uncertainty are analyzed; and (5) an attempt to understand 
whether differences in retrieved fluxes can be atttibuted to 
specific dynamical differences in meteorological fields. This 
last point is key: rather than point out when and where flux 
retrieval errors occur, we attempt to explain flux errors from 
a dynamical viewpoint. We note that despite increasing com- 
plexity, OSSE’s are highly simplified; for example, the effect 
of changes in aerosol optical depth and land surface type on 
XCO 2 retrievals is not considered. Given the similarity of ex- 
periments yet uniqueness of methods, Chevallier et al. (2010) 
provide an excellent benchmark for comparison. 


2 Methods 

2.1 Forward Simulations 

Global CO 2 Uansport is analyzed in the Parameterized 
Chemistty and Transport Model (PCTM, see Kawa et al., 
2004). Surface fluxes, described in Parazoo et al. (2008), 
are the same for each simulation, and include air-sea ex- 
change, constant in time fossil fuel emissions, and a steady 
state terrestrial biosphere. These fluxes are also used as “pri- 
ors” in Step 2 of the source/sink inversion experiments (see 
Sect. 2.3). PCTM is run from 1 January 2000 through 31 De- 
cember 2004 to spin up atmospheric gradients of CO 2 and 
create a common initial condition, and from 1 January 2005 
through 3 1 December 2005 using the four reanalysis prod- 
ucts (described below) to comprise the analysis period. 

Transport in the PCTM is computed off-line using 
archived meteorological analyses from different versions of 
the GEOS DAS. Two distinctly different GEOS analyses. 
Versions 4 and 5, are used in this study. These meteorolog- 
ical analyses differ in (1) physical parameterizations in the 
GCM; (2) native resolution; (3) the algorithm used for the 
meteorological analysis; and (4) the manner in which the 
analyses are assimilated with the GCM. The physical param- 
eterizations in the GEOS-4 GCM are substantially different 
from GEOS-5 (see below). GEOS-4 has a native resolution of 
1.25° x 1° (longitude x latitude) and 55 layers while GEOS- 
5 has a native resolution of 0.67° x 0.5° and 72 layers. Most 
of the additional layers in GEOS-5 are in the troposphere. 
This study uses meteorological datasets saved at the native 
resolution of each of these forecast models. We also use two 
additional datasets that are saved at a reduced resolution, giv- 
ing a total of four meteorological datasets based on the same 
dynamical core to transport CO 2 . The different analysis tech- 
niques are likely to impact simulations, but isolating this fac- 
tor is difficult in an offline framework. We therefore focus on 
the impact of horizontal grid spacing and sub-grid transport 
on forward simulations of CO 2 transport, keeping in mind 
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Table 1. Description of meteorological datasets used in offline tracer transport simulations. Column 1 (Data) is the code name for the 
meteorological dataset, 2 (Version) is the version of the analysis product, 3 (Native Resolution) is the resolution at which the analysis system 
(forecast + assimilation) is run (axb xc = latitude spacing x longitude spacing x number of vertical levels), 4 (Transport Resolution) is the 
grid spacing at which the analysis is saved, 5 (Full or Reduced) refers to whether the analysis is saved at the native resolution of the parent 
GCM or interpolated to a coarser resolution, 6 (Deep Convection) is the deep convection scheme of the parent GCM, and 7 (Turbulence) 
is the PBL turbulence scheme of the parent GCM. Naming convention for analysis product in the first column is as follows: character 1-2 
is the product version number (G4 = GEOS-4 and G5 = GEOS-5), character 3 is the grid at which the analysis is saved (F = full resolution 
and R = reduced resolution), and character 4—5 is the latitude grid spacing at which the analysis is saved for offline transport (05 = 0.5°, 
10 = 1 . 0 °, 20 = 2 . 0 °). 


Run 

Version 

Native 

Resolution 

Transport 

Resolution 

Full or 
Reduced 

Deep 

Convection 

Turbulence 

G5F05 

5.1.0 

0.5° x 0.67° 
x 72 

0.5° x 0.67° 
x 42 

Full 

Moorthi and 
Suarez (1992) 

Louis et al. (1982) 
Lock et al. (2000) 

G5R10 

5.2.0 

0.5° x 0.67° 
x 72 

1.25° x 1.0° 
x 42 

Reduced 
(regridded 
from G5F05) 

Moorthi and 
Suarez (1992) 

Louis et al. (1982) 
Lock et al. (2000) 

G4F10 

4.5.3 

1.25° x 1.0° 
x 55 

1.25° x 1.0° 
x 25 

Full 

Zhang and 
McFarlane (1995) 

Holtslag and 
Boville (1993) 

G4R20 

4.5.3 

1.25° x 1.0° 
x 55 

2.5° x 2.0° 
x 25 

Reduced 
(regridded 
from G4F10) 

Zhang and 
McFarlane (1995) 

Holtslag and 
Boville (1993) 


sensitivity to other factors. The key differences between these 
models are summarized in Table 1 and specified in more de- 
tail below. 

The GEOS-4 analysis (Bloom et al., 2005) procedure uses 
the Physical-space Statistical Analysis Scheme of Cohn et 
al. (1998), which produces an optimal combination of six- 
hour model forecasts and observations at the observation lo- 
cations. These are interpolated to the model grid and the 
model background fields (surface pressure, winds, temper- 
ature and moisture) are replaced with the analyses every six 
hours. Six-hour time-averaged GEOS-4 fields are used in the 
present study (Pawson et al., 2007). Physical parameteriza- 
tions in GEOS-4 are drawn from the National Center for At- 
mospheric Research Community Climate Model, Version 3 
(CCM3) package (Kiehl et al., 1998), which include deep 
convection (Zhang and McFarlane, 1995), shallow convec- 
tion (Hack, 1994), and Planetary Boundary Layer (PBL) tur- 
bulence (Holtslag and Boville, 1993). 

GEOS-5 analyses (Rienecker et al., 2008) use a three- 
dimensional variational approach in gridpoint space (Grid- 
point Statistical Analysis, GSI, Wu et al., 2002) that directly 
assimilates satellite radiances alongside the conventional me- 
teorological data. These analyses are input smoothly into the 
GEOS-5 GCM, using the incremental analysis update (IAU) 
approach of Bloom et al. (1996): this involves adding addi- 
tional forcing terms to the momentum, thermodynamic and 
moisture tendencies, which are the local (gridpoint) forces 
needed to drive the background forecasts to the analyses 
over the six-hour window of the assimilation. This method 
of merging the analyses to the model leads to smoothly vary- 


ing fields in the assimilation, which means that the transport 
in GEOS-5 is much smoother than in GEOS-4. The GEOS-5 
GCM maintains the finite-volume dynamics used in GEOS- 
4 (Lin, 2004) and is integrated with physics packages un- 
der the Earth System Modeling framework (e.g., Collins et 
al., 2005), including the Relaxed Arakawa-Schubert (RAS) 
scheme for convection (Moorthi and Suarez, 1992) and sep- 
arate PBL turbulent mixing schemes for stable (Louis et al., 
1982) and unstable (Lock et al., 2000) conditions. 

2.1.1 G4F10 

The first analysis used in this study is the GEOS-4 DAS, as 
implemented by Kawa et al. (2004) in the original PCTM 
study of CO 2 transport. Meteorological fields from GEOS- 
4 DAS are saved every 6h at the native resolution of 1.25° 
by 1°, with 55 hybrid vertical levels up to 0.01 hPa. Only 25 
levels are used for transport in PCTM, although all levels in 
the troposphere are retained. This re-analysis driver data will 
be referred to as G4F10. 1.25° by 1° transport in PCTM is 
run with a 7.5 min time step. 

2.1.2 G4R20 

G4F10 is regridded horizontally to 2.5° by 2° to study 
transport at coarser grid spacing. Vertical mixing by moist 
convection and turbulent diffusion is identical to G4F10. 
Vertical and horizontal advection is conserved, leading to 
smoothed spatial gradients. All 25 vertical levels are re- 
tained, and transport fields are saved at 6-hourly resolution. 
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Time-stepping through PCTM is doubled to 15 min. This re- 
gridded version of G4F10 is referred to as G4R20. 

2.1.3 G5F05 

The third product is based on version 5 of GEOS-DAS 
(GEOS -5 DAS, Rienecker et al., 2008). Native grid spac- 
ing and sub-grid vertical mixing are key differences from 
GEOS-4. This first GEOS-5 product is from Version 5.1.0, 
which was run for the period 1 October 2003-2 October 2008 
in support of NASA’s science missions. The native grid of 
GEOS-5 DAS (the grid at which the analysis is performed) 
is 0.67° by 0.5° in the horizontal with 72 layers to 0.01 hPa, 
31 of which are in the troposphere. The 72 vertical layers are 
reduced to 42 levels while retaining the 3 1 tropospheric lev- 
els. Instantaneous transport fields are saved at 6-hourly reso- 
lution. Reduced grid spacing requires a time step of 3.75 min. 
This driver data is referred to as G5F05. 

2.1.4 G5R10 

The fourth product is similar to G5F05 except with a newer 
version (5.2.0) of GEOS-5 DAS used for the Modern-Era 
Retrospective analysis for Research Applications (MERRA) 
(Rienecker et al., 2011). The system contains several im- 
provements from the GEOS-5. 1.0 system (Rienecker et al., 
2008), including tuning of the sub-grid physical packages 
and aspects of the GSI analysis system. Transport fields 
are re-analyzed (saved during a corrector segment of the 
Incremental Analysis Update rather than the analysis seg- 
ment) and saved every 3 h at a reduced horizontal resolu- 
tion of 1.25° by 1°. This facilitates comparison with the 
G4F10 dataset, although it should be noted that analyses with 
GEOS-5 performed at the 1.25° by 1° resolution differ in im- 
portant ways from the data that are interpolated to that res- 
olution. This re-analyzed driver data will be referred to as 
G5R10. 

2.2 Calculations of frontal CO 2 transport 

Eddy decomposition of CO 2 transport is described in more 
detail by Parazoo et al. (2011); here we provide a brief de- 
scription. Frontal CO 2 transport is diagnosed by parsing total 
column integrated meridional CO 2 transport as described by 
PCTM into eddy and mean components of the large-scale at- 
mospheric circulation. The eddy component arises from cor- 
related variations of mass flux and CO 2 mixing ratio. Frontal 
passage events are transient and migrating, tend to deviate 
strongly from the mean overturning circulation (e.g., Hadley 
Cell), and are therefore associated with strong variations of 
mass flux. When CO 2 gradients align with frontal air parcel 
trajectories along the meridional plane, the eddy component 
amplifies. These conditions are often satisfied in northern 
mid-latitudes (Keppel-Aleks et al., 2011). The vertical coor- 
dinate that describes frontal transport is also critical because 
variations of atmospheric fields are sensitive to trajectory. In 


order to capture moist ascent of air parcels along stormtracks 
(e.g., Pauluis et al., 2008), the analysis therefore relies on the 
calculation of eddy and mean transport on moist isentropic 
surfaces, which conserve energy as rising air condenses and 
releases latent heat. 

2.3 Synthetic inversion experiments 

The inversion system uses an ensemble-based approach to 
data assimilation referred to as the Maximum Likelihood En- 
semble Filter (MLEF; Zupanski et al., 2007; Lokupitiya et 
al., 2008). The strategy for flux estimation is based on the 
idea that high frequency variations in respiration and photo- 
synthesis are driven by relatively well-understood and eas- 
ily modeled processes that are determined to first order by 
variations in solar radiation (Zupanski et al., 2007), while 
slowly varying processes (e.g., nitrogen deposition) are typ- 
ically not modeled as well and lead to persistent biases in 
CO 2 exchange. We therefore prescribe hourly, synoptic, and 
seasonal variations in terrestrial CO 2 flux from models, and 
allow the inversion to solve for persistent biases at grid scale 
due to nitrogen deposition in northern mid-latitudes and CO 2 
fertilization in the tropics. 

Biases are assumed to be constant over the length of the 
data assimilation window. Lokupitiya et al. (2008) solved for 
8-week biases by assimilating synthetic surface data over the 
same period. This assimilation window was found to reason- 
ably recover fluxes given a sparse measurement network (Pe- 
ters et al., 2005). Although this window is short relative to 
the length of time needed to effectively capture signals from 
source regions (Bruhwiler et al., 2005), this extra informa- 
tion is mostly diluted by atmospheric mixing and comes at a 
greater computational cost. Satellite observing systems such 
as GOSAT greatly improve spatial coverage, and thus longer 
windows may not be required. Here, we consider a 2-week 
window. Given the short assimilation window and grid scale 
inversion, strong covariance smoothing is applied at the first 
cycle of MLEF, using e-folding length of 800 km over land 
points and 1600 km over ocean points. Further details regard- 
ing the assimilation scheme are discussed by Lokupitiya et 
al. (2008). 

GOSAT uses a sun-synchronous orbit with early afternoon 
sun-lit equator crossing time (~ 01 :30 p.m. LT) and orbital 
inclination near 98°. Synthetic GOSAT retrievals are gener- 
ated in forward simulations of the nature run (Step 1) us- 
ing pressure-weighted column averaging of vertical CO 2 pro- 
files. Since GOSAT measures CO 2 absorption using reflected 
solar radiation, PCTM is sampled only during daytime of 
the descending mode of orbit. Subsequent orbits are sep- 
arated by ~ 25° in longitude and ~ 99 min apart. GOSAT 
points near-nadir as well as at the sun glint spot, which 
greatly increases the signal over the ocean. We assume a 5- 
point cross-scan track, which was used on GOSAT between 
4 April 2009 and 31 July 2010, with footprints separated 
by ~ 158 km cross-track and ~ 1 52 km along track (Crisp et 
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Table 2. Summary of OSSE's used in source/sink inversions. Red shading refers to “perfect transport” experiments, green “signal detection”, 
and blue "biased transport”. The “true” flux refers to the set of surface CO 2 fluxes prescribed in Step 1 . The “prior” flux refers to surface 
fluxes used as a first guess in Step 2. Prior fluxes are described in Parazoo et al. (2008), and include a steady state terrestrial biosphere. This 
means that the true flux in Experiments 1, 2 and 5 is the same as the prior flux used in Step 2, while the true flux in Experiments 3, 4 and 6 
also includes a globally distributed 3 PgC sink (see Fig. 7a). 


Experiment 

Observation 
Transport (Step 1 ) 

Inversion 
Transport (Step 2) 

True Flux 

Cloud 

Screening 

1 

GEOS-4 

GEOS-4 

Prior 

No 

2 

GEOS-4 

GEOS-4 

Prior 

Yes 

3 

GEOS-4 

GEOS-4 

Prior + Constant 
3 PgC Sink 

Yes 

4 

GEOS-4 

GEOS-4 

Prior + Seasonal 
3 PgC Sink 

Yes 

5 

GEOS-5 

GEOS-4 

Prior 

Yes 

6 

GEOS-5 

GEOS-4 

Prior + Seasonal 
3 PgC Sink 

Yes 


al., 2012). Soundings are sampled at the native resolution of 
the meteorological analysis in the nature run (see below) at 
01:30 p.m. LT, and are assumed to represent the grid scale 
average. No temporal averaging of synthetic retrievals is ap- 
plied. A maximum of 281 points are sampled by GOSAT in 
one hour, corresponding to 94416 points over the 2-week as- 
similation period. All possible glint retrievals are retained, 
including those beyond ± 20° of latitude from solar declina- 
tion. In practice, however, glint mode is only used by GOSAT 
at latitudes within 20° of latitude from solar declination. We 
therefore run an additional OSSE in the signal detection ex- 
periment (Experiment 3, described below) to test for the im- 
pact of high latitude glint data on flux recovery. We prescribe 
a uniform uncertainty of 3 ppm to GOSAT retrievals; this is 
chosen as an upper bound from values computed by Cheval- 
lier et al. (2009) due to measurement noise, smoothing error, 
interference error component, and overall random contribu- 
tion of aerosols to retrieval noise. 

Clear conditions are defined as grid cells with cloud op- 
tical depth less than 0.3. Aerosol effects are not consid- 
ered in this study. Cloud optical depth is prescribed from 
MERRA and defined as the grid scale value (total in-cloud 
optical depth from ice and liquid water times the three di- 
mensional total cloud fraction in a grid box). MERRA es- 
timates of clear-sky ratio compare favorably with CALIOP 
estimates from Eguchi and Yokota (2008) in northern middle 
and boreal latitudes and in tropical regions. MERRA tends 
to overestimate the percentage of clear sky days in south- 
ern Africa, Austrialia, the mid-continental portion of south- 
ern S. America near 30° S, over the Southern Ocean, and the 
Arctic. Cloud screening leads to greater than 90% loss of 
data in most regions, especially in persistent cloudy areas in 
tropical and boreal latitudes. 

A total of six OSSE’s are reported in this study (see Ta- 
ble 2). Step 2 (inversion run) is identical for each OSSE, 
where MLEF is used for optimization, PCTM for offline 


transport, and G4R20 for meteorological forcing. Biases are 
therefore recovered at 2.5° x 2° resolution. Step 1 differs in 
the prescription of the true flux and meteorological forcing. 
OSSE’s 1 and 2 are “perfect transport” experiments, where 
G4R20 is used in Step 1. These experiments are simplified 
further in that the tme flux consists of a steady state terrestrial 
biosphere, and are therefore designed to establish baseline 
flux recovery errors related to ensemble optimization, ran- 
dom measurement error, measurement density reduction due 
to cloud screening, and temporal sampling biases. OSSE’s 
3 and 4 are “signal detection” experiments, which also use 
G4R20 in Step 1 but where the true flux includes the steady 
state terrestrial biosphere plus a slowly varying and glob- 
ally distributed 3 GtC sink. This sink represents the persis- 
tent bias in the steady state terrestrial biosphere as described 
above and in Zupanski et al. (2007). These experiments are 
designed to test whether slowly varying sinks due to poorly 
modeled processes such as nitrogen deposition and regrow- 
ing forests in northern middle latitudes and CO 2 fertilization 
in the tropics are recovered from synthetic satellite retrievals. 
Finally, OSSE’s 5 and 6 are “biased transport” experiments, 
where G5R05 is used in Step 1, and are therefore designed 
to quantify transport-driven flux errors. OSSE 5 uses the true 
flux as described in OSSE’s 1 and 2; OSSE 6 uses the true 
flux as described in OSSE 4. 

3 Results 

3.1 Transport differences 

Column integrated meridional CO 2 transport is shown in 
Fig. 1. Total column transport is approximately conserved 
between meteorological analyses. In the annual average, 
there is net northward transport north of 50° N and net south- 
ward transport south of 50° N (Fig. la); this pattern is domi- 
nated by winter transport (Fig. lb). The direction of eddy and 
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Fig. 1. Zonal-mean column integrated meridional CO 2 transport av- 
eraged over (A) 1 yr, (B) December-January-February and (C) June- 
July-August. Total meridional transport is shown in black, “mean” 
transport in blue, and “eddy” transport in red. Line styles cor- 
respond to transport by the four meteorological analyses, where 
G5F05 is cyan and solid, G5R10 as blue and dashed, G4F10 is green 
and dotted, and G4R20 is red and dash-dotted. 


mean CO 2 transport is independent of analyses. Eddy trans- 
port is on average poleward in both hemispheres and opposed 
by southward mean transport in northern latitudes. The di- 
rection of eddy and mean transport switches sign in summer 
north of 50° N (Fig. lc), consistent with strong CO 2 uptake. 
The magnitude of transport by eddy and mean circulations is, 
however, quite sensitive to the analyses. For example, eddy 
transport in northern mid-latitudes (40-60° N) is generally 
stronger and more poleward in GEOS-5, exceeding transport 
in GEOS-4 by 0. 1 PgC month - 1 in the annual mean and close 
to 0.2 PgC month - 1 during winter and summer. 

GEOS-4 models differ only in horizontal grid spacing; 
transport differences between G4R20 and G4F10, although 
small, are therefore explained by regridding of G4F10 to 
the coarser grid of G4R20. Different transport in GEOS-5 
models is also largely explained by differences in grid spac- 
ing, although data assimilation procedures and model physics 
updates between 5.1.0 and 5.2.0 (e.g., high latitude diurnal 



Fig. 2. Vertical profiles of (A) turbulent diffusion, (B) cumulus mass 
flux and (C) CO 2 mixing ratio, zonally averaged, binned into mid- 
latitudes (30-70° N) and plotted as a function of the hybrid coordi- 
nate eta (r/) for each of the four meteorological analyses (line styles 
correspond to Fig. 1). Annual averages are plotted in black, sum- 
mer averages in green. Mass fluxes from GEOS-4 are identical and 
therefore lie directly over one another. Note that vertical diffusion 
is zero in GEOS-4 at the lowest model level and cumulus mass flux 
is zero in GEOS-5 at the two lowest model levels. 


cycle) may also contribute. Transport differences between 
G4F10 and G5R10 are more complicated, and likely due to 
a combination of factors, including: (1) the representation of 
sub-grid scale processes, (2) horizontal/vertical grid spacing, 
(3) the new data assimilation system and (4) number and type 
of observational data assimilated. 

Sub-grid scale processes are probably easiest to examine 
because mass flux coefficients for cumulus convection and 
turbulent diffusion are saved in the reanalysis. Vertical mass 
fluxes are plotted as a function of the terrain following coor- 
dinate ij and averaged in northern mid-latitudes (30-70° N) 
in Fig. 2. Turbulent mixing (Fig. 2a) and cumulus convection 
(Fig. 2b) in GEOS-5 are consistently weaker than GEOS- 
4 throughout the column in northern mid-latitudes. Cumulus 
convection also starts higher in the PBL in GEOS-5 (Fig. 2b), 
reducing vertical transport from the surface. Weak vertical 
mixing in mid-latitudes in GEOS-5 is consistent with Ott 
et al. (2009), who found that single column model appli- 
cation of RAS-based moist convection significantly under- 
estimates convective mass flux relative to cloud resolving 
models for several case studies of mid-latitude convective 
storms, resulting in weaker vertical transport of trace gases. 
The findings of Ott et al. (2009) were subsequently used to 
tune several key parameters in the convective parameteriza- 
tion of the GEOS-5 GCM (S. Pawson, personal communica- 
tion, 2011). The effect of the tuning is apparent in the newer 
version of GEOS-5 (i.e., G5R10), where the vertical convec- 
tive mass flux is larger than G5F05 in mid-latitudes while 
preserving the vertical distribution. Weaker vertical mixing 
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Fig. 3. Spatial structure of column integrated meridional mass flux 
averaged over the period June-July-August for (A) G4R20 and 
(B) G5F05. (C) Difference in meridional mass flux, or (B) mi- 
nus (A). 


Fig. 4. Spatial structure of column integrated meridional eddy CCD 
transport averaged over the period June-July-August for (A) G4R20 
and (B) G5F05. (C) Difference in eddy transport, or (B) minus (A). 


in GEOS-5 traps CCD near the surface (Fig. 2c), causing 
a stronger negative (positive) vertical CO 2 gradient in the 
lower troposphere (t]= 1-0.9) in the annual (summer) av- 
erage. Since cold fronts and moist conveyor belts embedded 
in synoptic storms move CCD upward and poleward (Para- 
zoo et al., 2011), it follows that stronger vertical gradients in 
GEOS-5 enhance poleward eddy CO 2 transport. 

Differences in meridional wind associated with the com- 
bined (and non-linear) effects of sub-grid vertical transport, 
grid spacing, and data assimilation also cause differences 
in eddy CO 2 transport (Fig. 3). Meridional mass fluxes in 
GEOS-4 and GEOS-5 have similar spatial patterns during 
boreal summer in northern mid-latitudes, but have regional 
differences in magnitude, especially in in regions of equa- 
torward transport (blue shading). With the exception of en- 
hanced poleward transport in eastern Europe (25-50° E), 
equatorward and poleward mass fluxes are typically weaker 
in GEOS-5. Nevertheless, eddy CO 2 transport is generally 
stronger in GEOS-5 throughout mid-latitudes (Fig. 4), sug- 
gesting that enhanced vertical CO 2 gradients associated with 
weaker sub-grid vertical mixing are the primary driver of en- 
hanced eddy CO 2 transport in GEOS-5. 


3.2 Source/sink inversion 

The effect of differences in eddy transport on flux retrieval is 
tested in the following series of inversion experiments, start- 
ing with simplified “perfect transport” experiments. First, 
when all retrievals are retained (Experiment 1), we find that 
total annual flux errors, representing the difference between 
the recovered and true flux, are negligible (< O.lPgCyr \ 
Fig. 5a) and initial uncertainty is reduced by 50% (Fig. 5b), 
indicating MLEF inversion is robust to numerical errors. Sea- 
sonal errors are also small and mostly random (Fig. 6a). 
These baseline errors are not sensitive to the initial distri- 
bution of ensembles or ensemble size (50-200 ensembles), 
and do not increase when synthetic retrievals are randomly 
screened. For example, removal of 67 % of data using a ran- 
dom filter produces spatial patterns similar to Fig. 6a. 

Screening specifically for retrievals with cloud opti- 
cal depth greater than 0.3 (Experiment 2) also eliminates 
more than 67 % of observations in many regions; how- 
ever, such systematic screening leads to flux errors of up 
to 0.25PgCyr -1 in the tropics and 0.19PgCyr _1 in Eu- 
rope (Fig. 5, medium blue), which in northern mid-latitudes 
are most prominent during boreal summer (Fig. 6). Flux er- 
rors in N. America also increase, but carry a dipole pattern 
and tend to cancel in the annual mean. We note that, while 
flux errors increase in Experiment 2, they are not statisti- 
cally significant from errors in Experiment 1; for example. 
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Fig. 5. Bar plots of total annual terrestrial flux error (A) and un- 
certainty (B) for Experiments 1-6, aggregated up to regional and 
global scale. 


the difference between flux errors in Europe in Experiment 
1 and 2 10.14PgCyr ') is well within the uncertainty in the 
difference (0.32 PgC yr -1 ). Nevertheless, these results sug- 
gest that the size and spatial distribution of flux errors are 
more sensitive to the type, in contrast to quantity, of data re- 
moved. 

Summertime flux errors in Europe and N. America 
(Fig. 6b) tend to be distributed in a way that aligns with pat- 
terns of poleward eddy CO 2 transport by G4R20 (Fig. 4a). 
For example, high CO 2 air is transported into the east half of 
N. America during summer through two mechanisms: up- 
ward and poleward moving moist conveyors in the south 
(red shading in the Southeast), which advect high CO 2 sub- 
tropical air residing in the lower troposphere, and down- 
ward and equatorward moving dry air intrusions from the 
north, which advect high CO 2 arctic air from the upper tropo- 
sphere. Plots of column integrated meridional mass flux illus- 
trate these air streams (Fig. 3a). While equatorward-moving 
air masses are typically dry and well sampled by satellites, 
poleward-moving air is moist and cloudy, causing an esti- 
mated 30 % of the underlying CO 2 air mass to be hidden in 
clouds from satellites (Parazoo et al., 201 1), leading directly 
to a temporal sampling bias in the absence of surface obser- 
vations. Although both air masses are high in CO 2 , equator- 
ward transport from the north is stronger. The air sampled 
by satellites is therefore enriched in CO 2 relative to average 
conditions, and the inversion compensates by creating a CO 2 
“source” over N. America. A similar process of strong pole- 
ward transport of high CO 2 air in moist conveyors is respon- 
sible for the summer “source” in eastern Europe. 

Despite these sampling biases, 80% of the global 3.0 PgC 
sink represented by slowly varying processes in middle and 



Fig. 6. Flux errors during boreal summer (June-July-August) in (A) 
Experiment 1 and (B) Experiment 2. 


tropical latitudes is recovered (Experiment 3, Fig. 5, cyan). 
Perturbing the sinks with spatial noise and seasonality does 
not degrade recovery (Experiment 4, Fig. 5, yellow), nor does 
removing glint retrievals exceeding ± 20° latitude of solar 
declination (with the exception of slightly enhanced source 
in northern Europe and reduced source in temperate S. Amer- 
ica, not shown). The inversion also recovers the spatial dis- 
tribution of sinks in N. America and tropical regions (Fig. 7). 
While the effect of the temporal sampling bias is clear in Eu- 
rope, and reduces sink recovery in N. America, Experiments 
3 and 4 illustrate that the bias recovery approach can separate 
slowly varying signals contained in column integrated CO 2 
from high frequency signals, and then attribute these signals 
to the correct processes. 

In “biased” transport OSSE’s (Experiments 5 and 6), 
most regions experience an increase in total annual flux 
bias relative to baseline errors established in the equiv- 
alent “perfect” transport experiments (Experiments 2 and 
4, respectively; see Fig. 5). For example in Experi- 
ment 5, Europe (0.57 ± 0.25 PgC yr -1 ), Eurasian Tem- 
perate (0.32 ± 0.23 PgC yr -1 ), and S. America Tropical 
(0.21 ± 0.71 PgC yr -1 ) contribute to an enhanced global 
source (0.94 ± 1.23 PgC yr -1 ), while N. America temper- 
ate and boreal regions contribute to an enhanced sink 
(0.42 ± 0.45 PgC yr -1 ). Eastern N. America reverses from a 
weak source in Experiment 2 (Fig. 6b) to a strong sink in 
Experiment 5 (Fig. 8). The global land source increases to 
1 .4 PgC yr - 1 in Experiment 6 when slowly varying sinks are 
added (Fig. 5). The magnitude of flux errors in Experiment 6 
is similar to Experiment 5 and consistent with Chevallier et 
al. (2010). 

The difference between flux errors in perfect transport 
(Experiment 2 and 4) and biased transport (Experiment 5 
and 6) experiments primarily reflect differences in transport. 
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Fig. 7. (A) True and (B) recovered fluxes for Experiment 3, summed 
over one year. 
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Fig. 8. Flux errors in Experiment 5, averaged over (A) 1 yr and 
(B) June- July-August. 


poleward mass fluxes, and hence poleward eddy CO 2 trans- 
port, is enhanced in eastern Europe in G5F05. This amplifies 
the fraction of high CO 2 air hidden from satellites and there- 
fore amplifies the “source” found in Experiment 2. 


Differences in the calculation of the dry air mole frac- 
tion of CO 2 due to differences in humidity fields be- 
tween GEOS -4 and GEOS -5 are small in northern mid- 
latitudes (< 0.1 ppm at grid scale in the annual mean) 
and unlikely to contribute significantly to flux errors in 
N. America and Eurasia. With Experiment 6 as the ref- 
erence and focusing on northern temperate and boreal re- 
gions, we therefore estimate that transport errors create a 
European source of 0.43 ± 0.35 PgC yr -1 , Eurasian Temper- 
ate source of 0.15 ±0.32PgCyr _1 , N. American Temperate 
sink of 0.04 ±0.45 PgC yr -1 , and N. American Boreal sink 
of 0.15 ±0.20 PgC yr -1 . The amplified European source is 
most significant relative to uncertainty between inversions. 

Although high uncertainty between inversion experiments 
reduces the significance of source/sink amplification, these 
experiments reveal important sensitivities of flux estimates 
to transport patterns. In particular, systematic differences in 
eddy transport amplify temporal sampling bias in Europe and 
reverse the sign of bias in N. America. During N. Amer- 
ica summer, G5F05 reduces poleward mass flux in the south 
(relative to G4R20) and equatorward mass flux in the north 
(Fig. 3c). While both processes decrease transport of high 
CO 2 into N. America, reduction of equatorward transport of 
high CO 2 air exceeds reduction of poleward transport (blue 
shading in N. America, Fig. 4c). As a result, synthetic satel- 
lite data is depleted in CO 2 , in contrast to Experiment 2, and 
the inversion creates a “sink” over N. America. Meanwhile, 


4 Conclusions 

Upward and poleward frontal CO 2 transport feeds off the 
background vertical CO 2 gradient, which is modulated by 
sub-grid vertical transport processes such as cumulus con- 
vection and turbulent diffusion. Air masses transported by 
frontal processes are moist and contain systematically differ- 
ent CO 2 signals from equatorward-moving dry air masses. 
The implication for source/sink estimation is two-fold. First, 
CO 2 signals contained in moist air masses are hidden from 
orbital sensors on satellites. This causes small but system- 
atic errors in regional scale flux retrieval in northern mid- 
latitudes. Second, moist processes are represented differently 
in GEOS-4 and GEOS-5, leading to differences in the verti- 
cal CO 2 gradient and hence the fraction of moist CO 2 air 
hidden from satellites. Thus, differences in vertical transport 
amplify sampling biases, which are then aliased in the inver- 
sion through increases in continental scale flux retrieval er- 
rors throughout northern mid-latitudes. Specifically, the Eu- 
ropean source is amplified by 0.43 ± 0.35 PgC yr -1 while a 
weak source in eastern N. America is replaced with a strong 
sink. These results, cast from the perspective of moist frontal 
transport processes, support previous arguments that the ver- 
tical gradient of CO 2 is a major cause of errors in source/sink 
inversions. 

While these results are important for the interpretation of 
flux errors associated with satellite observations of CO 2 , we 
caution these errors in no way bracket the possible range of 
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flux errors to be expected in an inversion of real data. We 
have eliminated important factors such as transport model, 
assimilation system, and differences in specific humidity 
fields between GEOS-4 and GEOS-5, but have not addressed 
errors due to other differences in experimental setup, in par- 
ticular the calculation of XCO 2 . For example, we have not 
considered aerosol effects, land surface type, or surface pres- 
sure. In the absence of condensation, aerosol effects may be 
important for frontal transport in urban areas in Europe and 
eastern N. America or in regions of biomass burning. High 
aerosol burden could have a similar impact on flux inversions 
as clouds, and should therefore be a focus of future studies. 

We also caution that, while signal detection experiments 
suggest that two weeks of data assimilation may provide a 
sufficient constraint for flux inversion of satellite data, our 
results also suggest that it may contribute to flux biases. If 
CO 2 is mixed rapidly through the domain then we should 
expect that flux retrieval is not sensitive to the timescale for 
data assimilation. If, however, the timescale for transport is 
long compared to the timescale at which CO 2 is sampled, 
then biases may be introduced. If the time-scale for merid- 
ional transport is greater than two weeks, then a longer win- 
dow may be preferable. Our experiments assume that greater 
sampling coverage of satellites would permit shorter assim- 
ilation windows; we did not consider, however, that tempo- 
ral sampling errors associated with moist frontal transport 
would bias the flux inversion. Future studies should more 
thoroughly examine sensitivity of satellite inversions to data 
assimilation window. 

There is little doubt in the CO 2 inversion community 
that priority should be given to improving sub-grid verti- 
cal transport. Despite efforts to treat transport error, includ- 
ing joint inversion of column/surface CO 2 and meteorologi- 
cal data (Chevallier et al., 2011; Kang et al., 2011) and im- 
plicit accounting for transport errors through Monte Carlo 
and ensemble approaches (Chavallier et al., 2007; Teixeira 
et al., 2008; Liu et al., 2011), it is not clear to what extent 
these techniques fix or exacerbate transport errors. It seems 
that only better characterized model parameterizations, de- 
veloped based on extensive in situ meteorological measure- 
ments, will help address transport models. 
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